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ABSTRACT  A  meta-analysis  of  25  epidemiological  studies  estimated  the  prevalence  of  recent  Diagnostic  and 
Statistical  Manual  of  Mental  Disorders-IV  (DSM-IV)  major  depression  (MD)  among  U.S.  military  personnel.  Best 
estimates  of  recent  prevalence  (standard  error)  were  12.0%  (1.2)  among  currently  deployed,  13.1%  (1.8)  among 
previously  deployed,  and  5.7%  (1.2)  among  never  deployed.  Consistent  correlates  of  prevalence  were  being  female, 
enlisted,  young  (ages  17-25),  unmarried,  and  having  less  than  a  college  education.  Simulation  of  data  from  a  national 
general  population  survey  was  used  to  estimate  expected  lifetime  prevalence  of  MD  among  respondents  with  the 
sociodemographic  profile  and  none  of  the  enlistment  exclusions  of  Army  personnel.  In  this  simulated  sample,  16.2% 
(3.1)  of  respondents  had  lifetime  MD  and  69.7%  (8.5)  of  first  onsets  occurred  before  expected  age  of  enlistment. 
Numerous  methodological  problems  limit  the  results  of  the  meta-analysis  and  simulation.  The  article  closes  with  a 
discussion  of  recommendations  for  correcting  these  problems  in  future  surveillance  and  operational  stress  studies. 


INTRODUCTION 

Major  depression  (MD)  is  generally  recognized  to  be  among 
the  most  burdensome  of  all  disorders  in  the  U.S.  population1 
because  of  its  high  prevalence2  and  strong  adverse  effects  on 
role  functioning.3  As  exposure  to  highly  stressful  life  experi¬ 
ences  is  one  of  the  most  consistently  documented  risk  factors 
for  MD,4  it  is  not  surprising  that  exposure  to  combat  has  been 
shown  to  be  a  powerful  predictor  of  MD.5  Indeed,  available 
research  suggests  that  MD  might  be  as  common  as,6  or  per¬ 
haps  even  more  common  than,7  post-traumatic  stress  disorder 
(PTSD)  among  combat  veterans.  Yet,  much  more  research 
has  been  carried  out  on  the  prevalence  and  correlates  of 
PTSD  than  MD  among  military  personnel.5 

In  an  effort  to  synthesize  available  data  on  the  prevalence  of 
MD  and  its  relationship  to  deployment  experience,  we  carried 
out  a  quantitative  literature  review  and  meta-analysis  of  the 
recent  literature  on  the  epidemiology  of  MD  among  U.S.  mil¬ 
itary  personnel.  We  were  mindful  in  planning  this  analysis  that 
a  recent  review  found  a  high  range  of  MD  prevalence  estimates 
in  studies  of  military  personnel.6  The  authors  of  that  review 
cautioned  that  assessments  of  MD  in  the  reviewed  surveys 
were  typically  based  on  unvalidated  screening  scales  rather 
than  clinical  interviews  and  that  many  studies  used  conve¬ 
nience  samples  rather  than  probability  samples.  We  conse¬ 
quently  limited  our  review  to  epidemiological  studies  that. 
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with  a  few  notable  exceptions,  used  probability  sampling 
methods  and  validated  measures  of  MD. 

Despite  considerable  information  in  the  reviewed  studies 
on  current  prevalence  of  MD,  little  data  exist  on  lifetime 
prevalence  or  age-of-onset  of  MD  among  military  personnel. 
Such  data  could  be  valuable  in  determining  if  military  per¬ 
sonnel  with  current  MD  had  first  onsets  before  versus  only 
after  enlistment.  This  information  could  have  important 
implications  in  areas  such  as  large-scale  public  health  inter¬ 
ventions  and  Physical  Evaluation  Boards.  Even  though  direct 
data  on  lifetime  prevalence  are  absent,  simulation  methods 
can  be  used  to  make  indirect  estimates.  Messer  et  al9  did  this 
to  estimate  lifetime  prevalence  of  selected  mental  disorders 
in  the  Army  from  the  Epidemiologic  Catchment  Area  (ECA) 
study,10  a  large  community  epidemiological  survey  of  men¬ 
tal  disorders.  We  extend  the  work  of  Messer  et  al  here  by 
using  similar  methods  to  estimate  lifetime  prevalence  and 
age-of-onset  of  MD.  The  data  used  to  carry  out  this  simula¬ 
tion  are  from  the  National  Comorbidity  Survey  Replication 
(NCS-R),11  a  national  survey  of  the  prevalence  and  corre¬ 
lates  of  DSM-IV  mental  disorders  in  the  civilian  U.S.  house¬ 
hold  population. 

METHODS 

The  Meta-Analysis 

The  Literature  Search  Strategy 

We  searched  PubMed  (NCBI),  Embase  (Elsevier),  and 
PsycINFO  (EBSCO)  for  relevant  studies  published  between 
January  1,  1990  and  April  21,  2011  using  relevant  controlled 
vocabulary  terms  for  (i)  depression,  (ii)  military  personnel,  and 
(iii)  prevalence.  (A  detailed  description  of  the  full  search  strat¬ 
egy  is  available  on  request.)  The  search  yielded  1,216  non- 
duplicated  articles  for  review.  We  also  contacted  leading 

47 


Delivered  by  Publishing  Technology  to:  Guest  User  IP:  131 .84.1 1 .215  on:  Thu,  23  Aug  2012  12:24:56 
Copyright  (c)  Association  of  Military  Surgeons  of  the  U.S.  All  rights  reserved. 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

2012 

2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-2012  to  00-00-2012 

4.  TITLE  AND  SUBTITLE 

Prevalence  of  DSM-IV  Major  Depression  Among  U.S.  Military 

Personnel:  Meta-Analysis  and  Simulation 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Uniformed  Services  University  of  the  Health  Sciences, 4301  Jones  Bridge 
Road, Bethesda,MD, 20814 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

MILITARY  MEDICINE,  177,  8:47,  August  Supplement,  2012 

14.  ABSTRACT 

A  meta-analysis  of  25  epidemiological  studies  estimated  the  prevalence  of  recent  Diagnostic  and  Statistical 
Manual  of  Mental  Disorders-IV  (DSM-IV)  major  depression  (MD)  among  U.S.  military  personnel.  Best 
estimates  of  recent  prevalence  (standard  error)  were  12.0%  (1.2)  among  currently  deployed,  13.1%  (1.8) 
among  previously  deployed,  and  5.7%  (1.2)  among  never  deployed.  Consistent  correlates  of  prevalence 
were  being  female  enlisted,  young  (ages  17?25),  unmarried,  and  having  less  than  a  college  education. 
Simulation  of  data  from  a  national  general  population  survey  was  used  to  estimate  expected  lifetime 
prevalence  of  MD  among  respondents  with  the  sociodemographic  profile  and  none  of  the  enlistment 
exclusions  of  Army  personnel.  In  this  simulated  sample,  16.2%  (3.1)  of  respondents  had  lifetime  MD  and 
69.7%  (8.5)  of  first  onsets  occurred  before  expected  age  of  enlistment.  Numerous  methodological  problems 
limit  the  results  of  the  meta-analysis  and  simulation.  The  article  closes  with  a  discussion  of 
recommendations  for  correcting  these  problems  in  future  surveillance  and  operational  stress  studies. 


15.  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 

ABSTRACT 

18.  NUMBER 

OF  PAGES 

19a.  NAME  OF 

RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Same  as 
Report  (SAR) 

14 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Prevalence  ofDSM-IV  MD  Among  U.S.  Military  Personnel 


researchers  in  the  epidemiology  of  mental  disorders  in  the 
military  for  additional  studies  and  searched  for  relevant 
reports.  We  focused  on  studies  with  a  sample  size  of  at  least 
1,000  individuals  that  provided  estimates  of  recent  prevalence 
of  DSM-IV  MD  based  on  a  validated  screening  measure  (with 
demonstrated  concordance  to  a  diagnostic  interview)  or  a  diag¬ 
nostic  interview  in  a  probability  sample  of  individuals  currently 
(at  the  time  of  the  survey)  serving  in  the  U.S.  Armed  Forces. 
Studies  that  focused  on  clinical  populations  were  excluded. 

Two  independent  raters  reviewed  the  abstracts  and,  based 
on  the  inclusion  criteria,  identified  32  articles  and  13  reports 
for  detailed  review.  Nineteen  of  these  studies  were  subse¬ 
quently  excluded  because  they  did  not  meet  the  inclusion 
criteria  or  used  the  same  data  as  another  publication  that 
was  included.  (A  detailed  description  is  available  on  request). 
There  were  26  remaining  articles  and  reports  (Table  I).  One12 
used  a  subsample  of  a  larger  dataset. lj  Because  detailed 
information  on  MD  by  demographic  characteristics  was  only 
available  in  the  former,1^  that  study  was  included  in  the 
analysis  examining  sociodemographic  correlates  of  MD  but 
not  in  the  regression  analyses.  We  included  several  studies  in 
which  MD  was  assessed  with  a  version  of  the  Patient  Health 
Questionnaire  (PHQ)-9  that  included  an  additional  require¬ 
ment  of  self-reported  functional  impairment  even  though  we 
were  unable  to  find  a  validation  study  for  this  version  of  the 
PHQ-9.  This  was  done  because,  as  described  below,  we  were 
able  to  develop  a  calibration  for  this  version  of  the  scale  that 
approximated  the  more  standard  version.  With  regard  to 
random  sampling,  none  of  the  studies  considered  here  was 
based  on  an  unrestricted  probability  sample  of  the  entire 
Army.  The  baseline  Millennium  Cohort  Study14  and  the  peri¬ 
odic  Department  of  Defense  Surveys  of  Health-Related 
Behaviors  among  Military  Personnel 15-1 9  were  based  on  rep¬ 
resentative  samples  of  military  personnel  who  were  not 
deployed  at  the  time  of  sample  selection,  whereas  the  other 
studies  we  refer  to  as  probability  surveys  were  based  on 
samples  of  military  personnel  in  more  restrictive  sampling 
frames,  but  in  each  case  either  selected  a  probability  sample 
of  military  personnel  from  the  frame  or  attempted  to  survey 
all  personnel  in  the  selected  units  or  time  periods.  Several 
reports  did  not  use  probability  sampling  methods,20-23  but 
were  included  because  they  contained  prevalence  estimates 
for  individuals  currently  deployed  at  the  time  of  data  collection 
that  otherwise  would  have  been  strongly  underrepresented  in 
our  analyses.  A  small  number  of  the  studies  included  respon¬ 
dents  in  the  National  Guard  or  Reserves  who  might  have  been 
recently  deactivated  at  the  time  of  data  collection. 

Each  of  the  retained  studies  was  entered  into  a  data  file 
for  quantitative  analysis.  The  variables  included  the  preva¬ 
lence  estimate,  the  measure  on  which  the  prevalence  esti¬ 
mate  was  based  (see  below),  the  sample  size,  information 
about  whether  the  assessment  was  anonymous  or  not,  and 
the  deployment  status  of  respondents  at  the  time  of  data 
collection  (currently  deployed,  previously  deployed,  or  never 
deployed).  A  study  was  coded  as  anonymous  only  if  this 


was  explicitly  stated.  Studies  coded  not  anonymous  included 
confidential  surveys  in  which  identifying  information  was 
available  but  not  disclosed  to  anyone  not  connected  to  the 
research  and  surveys  that  were  mandatory  for  all  service 
members  returning  from  deployment,  which  were  maintained 
in  the  permanent  medical  record.  In  cases  where  a  single 
study  included  respondents  with  more  than  one  deployment 
status  and  a  MD  prevalence  estimate  was  presented  sepa¬ 
rately  by  deployment  status,  the  subsamples  with  different 
deployment  statuses  were  treated  as  separate  samples  and 
entered  as  distinct  observational  records  in  the  data 
file.  ’  In  cases  where  the  study  included  respondents 
with  more  than  one  deployment  status  but  MD  prevalence 
was  not  reported  by  deployment  status,  we  treated  the  study 
as  a  single  observational  record  and  entered  information 
in  the  data  file  for  the  proportions  of  respondents  that 
were  currently  deployed,  previously  deployed,  and  never 
deployed.  In  cases  where  these  proportions  were  not  reported 
in  the  study,  they  were  estimated  based  on  the  best  available 
information.  (A  detailed  description  of  the  estimation  meth¬ 
ods  is  available  on  request.)  The  majority  of  assessments 
across  studies  were  from  respondents  who  had  previously 
deployed  (83.8%).  Smaller  proportions  had  never  deployed 
(14.4%)  or  were  currently  deployed  (1.8%)  at  the  time  of  data 
collection.  Studies  that  assessed  MD  longitudinally31  or  cross- 
sectionally  at  two  time  points  (3  and  12  months  post¬ 
deployment)13  were  treated  as  separate  observational  records. 
The  subsamples  in  one  especially  large  study26  were  also 
treated  as  separate  observational  records.  This  resulted  in 
a  total  of  37  observational  records  in  the  final  data  file  repre¬ 
senting  a  total  of  712,698  assessments. 

Measures  ofMD  in  the  Reviewed  Studies 

The  measures  of  MD  in  the  reviewed  studies  include  the 
PHQ-2,40  PHQ-8,41  PHQ-9  with  a  severity  coding  scheme,42 
PHQ-9  with  a  DSM-IV  coding  scheme  (PHQ-9/DSM-IV),34 
PHQ-9/DSM-IV  with  an  impairment  requirement,27  the 
three-item  version  A  Burnam  depression  screen,43  the  eight- 
item  Burnam  depression  screen,44  and  the  Center  for  Epide¬ 
miologic  Studies  Depression  Scale  (CES-D).45  Recall  periods 
were  1  week,  2  weeks,  or  1  month  before  interview  (Table  I). 
These  recall  periods  were  treated  as  equivalent  in  assessing 
recent  prevalence  for  the  analysis.  All  the  measures  are 
screening  scales;  that  is,  although  they  assess  some  of  the 
key  symptoms  of  DSM-IV  MD,  they  are  designed  to  generate 
quick  estimates  of  possible  diagnosis  rather  than  definitive 
diagnoses.  We  were  unable  to  identify  any  studies  using 
diagnostic  interviews  that  met  our  inclusion  criteria. 

Quantitative  Analysis  of  the  Reviewed  Studies 
Quantitative  analysis  was  carried  out  to  examine  effects 
of  methodological  factors  (the  measure  on  which  the  preva¬ 
lence  estimate  was  based,  sample  size,  anonymous  versus  not 
anonymous)  and  deployment  status  on  MD  prevalence  esti¬ 
mates.  Random-effects  multilevel  regression  analysis  was 
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TABLE  I.  Description  of  Studies  Included  in  the  Analyses 


Study 

ID 

A 

Branch 

N  AF 

a 

M 

CG 

Deployment  Status^ 

PD  CD  ND 

Anonymousc 

Measure^ 

RP‘ 

RR7  (%) 

Prevalence 
of  MD  (%) 

Years^ 

00* 

Citation 

la 

X 

X 

X 

Yes 

5 

3 

54' 

6.0 

2003 

4,529 

Cabrera  200724 

lb 

X 

X 

Yes 

5 

3 

54' 

7.8 

2004 

2,392 

Cabrera  200724 

2a 

X 

X 

X 

Yes 

4 

3 

77 

11.4 

2003 

2,418 

Hoge  200425 

2b 

X 

X 

X 

Yes 

4 

3 

53 

14.5 

2003 

3,500 

Hoge  200425 

3a7 

X 

X 

X 

No 

1 

2 

+* 

3.5 

2003-04 

16,318 

Hoge  200626 

3b/ 

X 

X 

X 

No 

1 

2 

6.1 

2003-04 

222,620 

Hoge  200626 

3c7 

X 

X 

X 

No 

1 

2 

+* 

2.7 

2003-04 

64,967 

Hoge  200626 

4 

X 

X 

Yes 

5 

3 

59 

5.3 

2006 

2,464 

Hoge  200827 

5a 

X 

X 

X 

Yes 

4 

2 

89 

2.0 

2007-08 

1,910 

Kline  201028 

5b 

X 

X 

Yes 

4 

2 

89 

5.1 

2007-08 

625 

Kline  201028 

6 

X 

X 

No 

6 

1 

48 

37.4 

2005 

4,089 

Lapierre  200729 

7a 

X 

X 

X 

No 

3 

2 

+* 

4.5 

2004-08 

6,943 

Luxton  20 1030 

7b 

X 

X 

No 

3 

2 

10.1 

2006-09 

6,943 

Luxton  20 1030 

8a' 

X 

X 

No 

1 

2 

+* 

4.7 

2004-06 

56,350 

Milliken  200731 

8b' 

X 

X 

No 

1 

3 

+* 

10.3 

2005-06 

56,350 

Milliken  200731 

8c' 

X 

X 

No 

1 

2 

+* 

3.8 

2004-06 

31,885 

Milliken  200731 

8d' 

X 

X 

No 

1 

3 

+* 

13.0 

2005-06 

31,885 

Milliken  200731 

9 

X 

X 

No 

4 

2 

+* 

4.2 

2005-07 

1,301 

Reger  200933 

10 

X 

X 

X 

X 

X 

X 

X 

No 

4 

2 

36 

3.2 

2001-03 

76,476 

Riddle  200714 

1  la"1 

X 

X 

Yes 

5 

3 

58 

5.0 

2005-07 

2,454 

Riviere  2011 12 

lib'" 

X 

X 

Yes 

5 

3 

71 

7.3 

2005-07 

1,415 

Riviere  2011 12 

12a" 

X 

X 

Yes 

4 

3 

62 

16.0 

2004-07 

4,723 

Thomas  201013 

12b" 

X 

X 

Yes 

4 

3 

62 

15.7 

2004-07 

3,749 

Thomas  201013 

12c" 

X 

X 

Yes 

4 

3 

62 

11.5 

2004-07 

2,607 

Thomas  201013 

12d" 

X 

X 

Yes 

4 

3 

62 

15.9 

2004-07 

1,501 

Thomas  201013 

13 

X 

X 

Yes 

3 

2 

91 

15.9 

2004 

1,090 

Warner  200735 

14 

X 

X 

X 

X 

X 

X 

Yes 

7 

1 

70 

17.6 

1995 

16,193 

Bray  199515 

15 

X 

X 

X 

X 

X 

X 

Yes 

7 

1 

59 

16.1 

1998 

17,264 

Bray  199916 

16 

X 

X 

X 

X 

X 

X 

Yes 

8 

1 

56 

18.8 

2002-03 

12,756 

Bray  200317 

17 

X 

X 

X 

X 

X 

X 

Yes 

7 

1 

52 

22.3 

2005 

16,146 

Bray  2006 18 

18 

X 

X 

X 

X 

X 

X 

X 

Yes 

7 

1 

72 

21.1 

2008 

28,546 

Bray  200919 

19 

X 

X 

Yes 

5 

3 

++° 

5.0 

2004 

2,064 

MHAT-II  200520 

20 

X 

X 

Yes 

5 

3 

++° 

8.0 

2005 

1,124 

MHAT-III  200621 

21 

X 

X 

X 

Yes 

5 

3 

++° 

7.7 

2006 

1,767 

MHAT-IV  200622 

22 

X 

X 

Yes 

5 

3 

++° 

7.2 

2007 

3,114 

MHAT-V  200823 

23 

X 

X 

Yes 

5 

3 

++° 

4.8 

2009 

1,360 

MHAT-VI  2009a36 

24 

X 

X 

Yes 

5 

3 

++° 

4.9 

2008-09 

2,442 

MHAT-VI  2009b37 

25 

X 

X 

X 

Yes 

5 

3 

++° 

4.8 

2010 

1,246 

MHAT-VII  2011 38 

HP 

X 

X 

X 

X 

X 

No 

2 

2 

44 

10.8 

2007-08 

1,041 

Schell  200839 

With  the  exception  of  studies  19  to  22,  the  studies  used  probability  sampling  methods  to  select  samples. 

"A,  Army;  N,  Navy;  AF,  Air  Force;  M,  Marines;  CG,  Coast  Guard. 

foPD,  Previously  deployed;  CD,  Currently  deployed;  ND,  Never  deployed.  83.8%  of  assessments  across  studies  were  from  respondents  who  had  previously 
deployed,  14.4%  never  deployed,  and  1.8%  currently  deployed. 
cSee  the  text  for  the  definition  of  anonymity. 

rfPHQ-2  =  1,  PHQ-8  =  2,  PHQ-9  =  3,  PHQ-9/DSM-IV  =  4,  PHQ-9/DSM-IV  +  Functional  impainnent  (FI)  =  5,  CES-D  =  6,  3-item  version  A  Bumam 
depression  screen  =  7,  8-item  Bumam  depression  screen  =  8.  (Information  on  the  cut-points  used  for  these  scales  is  available  on  request.) 

‘RP,  Recall  period:  1  week  =1,2  weeks  =  2,  1  month  =  3. 

^RR,  Response  rate.  Although  RR  is  reported  based  on  the  information  provided  in  the  publications,  it  should  be  noted  that  there  is  wide  variation  in  the  ways 
response  rates  are  reported  in  the  literature.  Even  though  standards  for  reporting  response  rates  exist  (http://www.aapor.org/Response_Rates_An_Overviewl 
.htm),  few  of  the  studies  included  in  our  analysis  reported  information  about  their  response  rate  calculation  methods  in  enough  detail  to  tell  which  of  these 
definitions  they  used. 
gYears:  Year(s)  when  data  were  collected. 

*If  available,  the  sample  size  is  reported  as  the  number  of  respondents  actually  screened  for  depression  (which  in  some  cases  is  different  from  the  total  number 
of  respondents  included  in  analyses  in  the  studies). 

'This  is  an  estimate  based  on  personal  communication  with  Dr.  Hoge  (September  20,  2011). 

1 3a  is  a  subsample  of  respondents  surveyed  after  return  from  Operation  Enduring  Freedom,  3b  from  Operation  Iraqi  Freedom,  and  3c  from  other  locations. 
^Studies  3  and  7  to  9  are  based  on  the  mandatory  Postdeployment  Health  (Re)Assessment  or  equivalents  and  response  rates  were  not  reported. 

78a  and  8b  are  longitudinal  assessments  of  a  sample  of  active  duty  soldiers  (approximately  6  months  between  assessments);  8c  and  8d  are  equivalent  for  a 
sample  of  National  Guard  and  Reserve  soldiers. 

(continues) 
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used  to  analyze  the  data.46  This  is  the  preferred  method  for 
quantitative  meta-analysis  because  it  allows  information 
about  both  sample  size  and  study  characteristics  to  be 
included  in  the  analysis.  The  random  effects  model  includes 
terms  both  for  sampling  error  (sample-size  dependent)  and 
model  error  (representing  effects  of  study-specific  variation 
independent  of  sample  size,  such  as  unobserved  variations  in 
measurement  methods,  population,  and  context).  In  this  way, 
the  model  gives  more  weight  to  larger  than  smaller  studies 
but  does  not  allow  any  single  very  large  study  to  swamp  the 
effect  of  smaller  studies  because  weighting  takes  into  consid¬ 
eration  the  extent  to  which  each  observation  deviates  from  the 
overall  pattern  in  the  full  sample  and  down-weights  observa¬ 
tions  that  have  large  deviations.  The  analysis  was  carried  out 
initially  with  the  observed  study  prevalence  estimate  as  the 
outcome  and  subsequently  repeated  with  recalibrated  mea¬ 
sures  of  prevalence  described  below.  The  coefficients  in  these 
models  were  then  used  to  estimate  the  prevalence  of  DSM-IV 
MD  separately  for  deployed,  previously  deployed,  and  never 
deployed  military  personnel  by  generating  a  predicted  preva¬ 
lence  estimate  from  the  model  coefficients  separately  for 
respondents  in  each  of  the  three  deployment  statuses  based  on 
assumptions  about  calibrations  used  to  equalize  estimates 
across  types  of  measures  described  below.  As  we  found  that 
anonymity  of  reports  is  significantly  related  to  elevated  preva¬ 
lence  estimates,  the  predicted  prevalence  estimates  were  made 
based  on  the  assumption  that  MD  was  assessed  in  an  anony¬ 
mous  survey.  Standard  errors  of  the  prevalence  estimates  were 
generated  using  the  jackknife  resampling  method.47 

The  Simulation 

The  Sample 

As  noted  in  the  introduction,  the  simulation  study  was  based 
on  the  NCS-R,11  a  2001-2003  national  face-to-face  survey 
of  the  prevalence  and  correlates  of  DSM-IV  mental  disorders 
in  the  adult  (ages  18+)  civilian  U.S.  household  population. 
The  response  rate  was  70.9%.  The  interview  was  conducted 
in  two  parts.  Part  1,  completed  by  all  9,282  respondents, 
assessed  a  core  set  of  DSM-IV  mental  disorders,  whereas  Part  2, 
administered  to  all  Part  1  respondents  who  screened  positive 
for  at  least  one  Part  1  disorder  (n  =  4,235)  plus  a  probability 
subsample  of  other  Part  1  respondents  (n  =  1,457),  assessed 
additional  disorders  and  correlates.  The  Part  2  sample  was 
weighted  to  adjust  for  differential  probabilities  of  selection  and 
the  undersampling  of  respondents  with  no  Part  1  disorder.  A  final 
poststratification  weight  was  used  to  match  the  Part  2  sample 
to  the  2000  census  on  a  variety  of  sociodemographic  and  geo¬ 


graphic  variables.  These  sampling  and  weighting  procedures  are 
discussed  in  more  detail  elsewhere.48 

Sample  Matching 

We  subsampled  Part  2  NCS-R  respondents  to  create  a  weighted 
subsample  that  matched  the  population  of  active  duty  Army 
personnel  as  closely  as  possible.  We  focused  on  Army  personnel 
rather  than  military  personnel  more  generally  because  the  sim¬ 
ulation  was  carried  out  as  part  of  planning  for  the  Army  Study 
To  Assess  Risk  and  Resilience  in  Servicemembers  (Army 
STARRS;  http://www.amiystarrs.org). 

Subsampling  began  by  limiting  NCS-R  respondents  to  those 
in  the  age  range  18  to  65  with  at  least  a  high  school  education  (or 
general  education  diploma  [GED])  who  were  employed  and  had 
health  insurance.  We  then  excluded  respondents  who  would  be 
ineligible  for  Army  service  based  on  (1)  conviction  of  a  felony 
or  serving  at  least  1  year  in  prison;  (2)  handicaps,  including  deaf¬ 
ness,  blindness,  paralysis,  or  a  missing  limb;  (3)  chronic  physical 
disorders,  including  cardiovascular  disorders  (heart  attack,  stroke, 
hypertension,  heart  disease),  respiratory  disorders  (COPD, 
asthma),  diabetes,  ulcer,  HIV-AIDS,  epilepsy  or  seizure  disor¬ 
der,  Crohn’s  disease,  cancer  (except  skin  cancer),  severe  migraines, 
and  extreme  obesity;  and  (4)  severe  mental  disorders,  including 
schizophrenia,  other  nonaffective  psychoses,  bipolar  (BP)-I  dis¬ 
order,  and  serious  suicide  attempts  that  occurred  before  the 
imputed  age  of  enlistment.  These  exclusions  are  overinclusive 
in  that  they  remove  people  who  might  have  entered  the  Army 
with  waivers  or  developed  chronic  conditions  after  enlistment. 

Once  the  NCS-R  sample  was  restricted  in  these  ways,  we 
selected  a  series  of  eight  weighting  variables  available  in  the 
NCS-R  and  the  Defense  Department  Defense  Manpower 
Data  Center  (DMDC)  master  personnel  dataset  for  Army 
personnel  who  were  on  active  duty  in  December  2007  (http:// 
www.virec.research.va.gov/Non-VADataSources/DMDC.htm). 
The  eight  weighting  variables  were  age,  sex,  race-ethnicity 
(non-Hispanic  black,  non-Hispanic  white,  Hispanic,  and  all 
others),  education  (high  school  graduates  including  those  with 
a  GED,  some  post-high  school  education  without  a  bachelor’s 
degree,  and  bachelor’s  degree  or  more  education),  marital  status 
(married,  never  married,  and  previously  married),  U.S.  citizen¬ 
ship  (yes  or  no),  nativity  (i.e.,  born  in  the  United  States  yes  or 
no),  and  religion  (Protestantism,  Catholicism,  Judaism,  Eastern 
[Buddhism,  Hinduism,  Islam],  other,  and  atheist  or  no  religion). 
These  variables  were  selected  for  weighting  because  they  are 
known  to  be  significantly  related  to  mental  disorders  and  to 
have  a  significantly  different  distribution  among  Army  personnel 
than  the  general  U.S.  population,  although  coarseness  of  some 
weighting  categories  makes  the  matching  inexact. 


Table  I  (continued) 

"'The  data  used  in  Riviere  et  al.12  (collected  at  3  months  [11a]  and  12  months  [lib]  postdeployment)  are  a  subsample  of  the  data  used  in  Thomas  et  al,13  but 
the  former  is  used  to  calculate  ORs  because  it  reports  additional  information  about  prevalence  by  demographic  characteristics. 

"Cross-sectional  samples  of  Active  Component  and  National  Guard  soldiers  were  assessed  at  3  months  (12a  and  12c,  respectively)  and  12  months  (12b  and 
12d,  respectively). 

"No  response  rates  are  available  for  studies  19  to  25  (Dr.  Bliese,  personal  communication,  August  29,  2011). 

^Respondents  who  were  discharged  or  retired  were  excluded. 
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Prevalence 


FIGURE  1.  Inverse  cumulative  distribution  function  of  MD  prevalence  based  on  studies  included  in  the  analyses  (based  on  37  observational  records 
containing  a  total  of  712,698  assessments). 


The  NCS-R  weights  were  generated  by  using  an  exponen¬ 
tial  weighting  function  to  make  the  distributions  of  the  eight 
weighting  variables  in  the  adjusted  NCS-R  sample  agree  with 
the  distributions  in  the  DMDC  dataset.  (A  detailed  descrip¬ 
tion  of  weighting  procedures  is  available  on  request.) 

Regression-based  imputation  was  used  to  assign  an  esti¬ 
mated  age  of  enlistment  to  each  NCS-R  respondent  by  esti¬ 
mating  a  multiple  regression  equation  using  cross-tabulations 
of  weighting  variables  from  the  DMDC  in  which  the  eight 
variables  were  used  to  predict  age  of  enlistment.  The  regres¬ 
sion  coefficients  from  that  equation  were  then  applied  to  the 
NCS-R  dataset  to  impute  individual-level  estimates  of  age  of 
enlistment  to  match  the  DMDC  distribution  conditional  on 
the  matching  variables. 

Measurement  ofDSM-IV  MD  in  the  NCS-R 
MD  was  assessed  in  the  NCS-R  with  Version  3.0  of  the  World 
Health  Organization  Composite  International  Diagnostic  Inter¬ 
view  (CIDI),32  a  fully  structured  lay-administered  interview 
that  generates  diagnoses  for  commonly  occurring  DSM-IV 
mental  disorders.  Good  concordance  was  found  between  CIDI 
diagnoses  and  blinded  clinical  assessments  in  a  NCS-R  clinical 
reappraisal  study.49  The  CIDI  yields  information  on  lifetime 
history,  age  at  first  onset,  and  presence  of  MD  in  the  past  year. 
Diagnoses  were  assigned  based  on  reports  of  symptoms,  dura¬ 
tion,  and  intensity  as  specified  in  the  DSM-IV. 

RESULTS 

The  Meta-Analysis 

MD  Prevalence  Estimates  in  the  Reviewed  Studies 
MD  prevalence  estimates  vary  widely  across  the  37  observa¬ 
tional  records,  from  a  low  of  2.0%  to  a  high  of  37.4%  (Fig.  1). 
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Figure  1  shows  the  proportion  of  studies  with  prevalence 
estimates  above  the  levels  on  the  horizontal  axis.  There  are 
four  different  lines  in  the  figure  based  on  the  cross-classification 
of  two  dichotomous  distinctions:  (i)  either  using  a  weight  to 
adjust  the  37  different  studies  for  variation  in  sample  size  or 
treating  the  studies  as  equal  in  importance  regardless  of  sam¬ 
ple  size  and  (ii)  either  using  a  calibration  method  described 
below  to  adjust  prevalence  estimates  or  considering  preva¬ 
lence  estimates  in  the  metrics  reported  in  Table  I.  For  current 
purposes,  the  reader  should  focus  only  on  the  two  lines  with¬ 
out  calibration. 

The  upper  left  corner  of  the  figure  shows  that  100%  of 
studies,  by  definition,  have  a  prevalence  estimate  of  0.0%  or 
more.  The  line  for  the  unweighted  (for  variation  in  sample 
size  across  studies)  and  uncalibrated  distribution  across  stud¬ 
ies  shows  that  median  prevalence  (i.e.,  the  prevalence  of  the 
study  with  the  19th  highest  prevalence  out  of  the  37  studies) 
is  7.8%,  the  mean  10.3%,  and  the  interquartile  range  (IQR: 
25th-75th  percentiles)  is  4.8  to  15.7%.  Weighting  for  varia¬ 
tion  in  sample  size  across  observations  substantially  reduces 
estimates  both  of  central  tendency  (median  from  7.8  to  6.1%; 
mean  from  10.3  to  8.0%)  and  spread  (IQR  from  4.8  to  15.7% 
to  3.8  to  10.3%). 

One  possible  reason  for  the  wide  variation  in  these  preva¬ 
lence  estimates  is  that  the  different  screening  scales  might 
differ  in  sensitivity  and  specificity.  Published  validity  studies 
are  consistent  with  this  possibility,  suggesting  that  the  PHQ-9 
with  severity  scoring  has  the  highest  sensitivity  and  that  the 
eight-item  Burnam  scale  has  the  highest  specificity  (Table  II). 
These  validation  studies  typically  administered  a  gold  stan¬ 
dard  clinical  reappraisal  interview  to  a  probability  subsample 
of  people  shortly  after  they  were  administered  the  screening 
scale.  The  clinical  interviewers  typically  were  blinded  to  the 
screening  scale  scores.  These  validity  studies  were  conducted 
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TABLE  II.  Sensitivity  and  Specificity  of  Screening  Measures  Compared  to  Diagnostic  Interviews 


Measures" 

Sensitivity 

(%> 

Specificity 

(%) 

(«) 

Criterion" 

Reference 

PHQ-2 

83 

90 

580* 

Overview  of  SCID-DSM-III-R  and 

Diagnostic  Questions  from  PRIME-MD, 

Major  Depressive  Disorder,  Past  Month 

Kroenke  et  al  (2003 )40 

PHQ-9 

88 

88 

580* 

Overview  of  SCID-DSM-III-R  and 

Diagnostic  Questions  from  PRIME-MD, 

Major  Depressive  Disorder,  Past  Month 

Kroenke  et  al  (200 1)42 

PHQ-9/DSM-IV 

77 

94 

3,001* 

Formal  Interview  Based  on  DSM-IV; 

SCID-DSM-IV;  Overview  of  SCID-DSM-III-R 
and  Diagnostic  Questions  from 

PRIME-MD,  Major  Depressive  Disorder^ 

Wittkampf  et  al  (2007)50 

Bumam-3A 

81 

95 

3,1 161' 

DIS-DSM-III,  Major  Depressive  Episode  or 
Dysthymia,  Past  Year 

Rost  et  al  (1993)43 

Bumam-8 

77 

97 

3,1 16^ 

DIS-DSM-III,  Major  Depressive  Episode  or 
Dysthymia,  Past  Year 

Rost  et  al  (1993)43 

20  Item  CES-D 

80 

71 

425* 

SCID-DSM-III-R,  Major  Depressive 

Disorder,  Past  Month 

Fechner-Bates  et  al  (1994)51 

“PRIME-MD,  Primary  Care  Evaluation  of  Mental  Disorders;  SCID,  Structured  Clinical  Interview  for  DSM-IV/DSM-III-R;  DIS,  Diagnostic  Interview 
Schedule.  The  PHQ-8  and  PHQ-9  with  severity  coding  scheme  have  very  similar  operating  characteristics52  and  are  highly  correlated53  and  are  thus  treated 
as  equivalent.  To  our  knowledge,  there  are  no  studies  that  examined  the  sensitivity  and  specificity  of  the  PHQ-9/DSM-IV  plus  self-reported  functional 
impairment  at  the  cut  point  used  in  this  study. 

5  Data  were  collected  in  primary  care  clinics. 

‘Data  were  collected  in  a  community  sample. 

^Pooled  results  from  4  studies  that  compared  the  PHQ-9/DSM-IV  with  various  structured  interviews  using  a  random  effects  model.  Only  1  study  provided  the 
time  frame  of  the  criterion,  which  was  past  month. 


in  civilian  populations,  though,  mainly  in  primary  care  sam¬ 
ples,  making  it  unclear  whether  the  estimates  of  sensitivity 
and  specificity  in  these  studies  generalize  to  nonpatient  sam¬ 
ples  of  military  personnel. 

It  is  possible  to  adjust  prevalence  estimates  in  a  screening 
scale  to  approximate  estimates  of  “true”  prevalence  if  infor¬ 
mation  is  available  on  the  sensitivity  and  specificity  of  the 
screening  scale.  For  example,  if  we  know  that  sensitivity  is 
50%  (i.e.,  half  of  the  true  cases  are  detected  by  the  screening 
scale)  and  specificity  is  100%  (i.e.,  all  the  true  noncases  are 
classified  as  noncases  by  the  screening  scale),  then  the  esti¬ 
mated  prevalence  is  only  50%  as  high  as  the  true  prevalence, 
meaning  that  the  best  estimate  of  true  prevalence  is  two  times 
the  prevalence  estimate  in  the  screening  scale.  A  standard 
formula  exists  for  converting  prevalence  estimates  in  screen¬ 
ing  scales  to  estimates  of  true  prevalence  based  on  informa¬ 
tion  about  sensitivity  and  specificity.54  We  used  that  formula 
to  adjust  the  prevalence  estimates  based  on  the  screening 
scales  in  each  of  the  studies  reported  in  Table  II.  The  esti¬ 
mates  of  sensitivity  and  specificity  used  in  doing  this  were 
the  published  estimates  of  sensitivity  and  specificity  for  these 
screening  scales.  However,  these  transformations  yielded 
implausible  estimates  of  true  MD  prevalence.  This  was  most 
clear  in  quite  a  few  studies  where  estimated  prevalence  was 
strongly  negative  (i.e.,  not  merely  within  sampling  error  of 
0.0%,  but  substantially  less  than  0.0%). 

A  negative  prevalence,  of  course,  is  impossible.  So,  what 
does  it  mean  to  find  that  adjusted  prevalence  estimates  are 
negative?  (Detailed  results  of  this  estimation  exercise  are 


available  on  request  but  are  not  presented  here  because  so 
many  of  the  estimates  are  implausible.)  It  means,  quite  sim¬ 
ply,  that  the  sensitivity  and  specificity  estimates  in  the 
published  validity  studies  of  the  screening  scales  do  not  apply 
to  the  samples  considered  here.  That  is,  the  true  sensitivity 
and  specificity  of  the  screening  scales  in  the  studies  where 
they  were  used  must  have  been  different  than  the  sensitivity 
and  specificity  estimated  in  the  methodological  studies  of 
the  screening  scales.  There  are  a  variety  of  reasons  why  this 
might  be  the  case,  but  the  most  plausible  one  is  that  the 
samples  used  in  the  original  validity  studies  of  the  screening 
scales  might  have  been  different  than  those  in  the  substantive 
studies  reported  in  Table  II  (e.g.,  more  severe  cases  of  MD, 
which  would  lead  to  differences  in  rates  of  detection).  We 
have  no  way  to  produce  more  accurate  estimates  of  sensitiv¬ 
ity  and  specificity  for  the  studies  in  Table  II,  as  these  studies 
did  not  include  the  blinded  clinical  reappraisal  interviews 
with  probability  subsamples  of  respondents  that  would  be 
required  to  calculate  independent  estimates  of  sensitivity 
and  specificity  for  these  specific  studies.  Based  on  these  facts, 
we  abandoned  the  attempt  to  correct  prevalence  estimates 
in  these  studies  for  differential  sensitivity  and  specificity. 

Yet,  the  substantial  variation  in  prevalence  estimates 
across  these  studies  raises  the  possibility  that  between- 
measure  variation  in  concordance  with  clinical  diagnoses 
could  be  important.  In  the  absence  of  being  able  to  correct 
for  this  variation  by  using  sensitivity-specificity  adjustments, 
we  turned  to  a  different  method:  calibration  of  the  prevalence 
estimates  across  studies  to  a  common  metric  by  making  use 
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of  the  fact  that  a  number  of  epidemiological  studies — some 
of  them  in  military  samples  and  others  in  civilian  samples — 
have  reported  MD  prevalence  estimates  based  on  two  or  more 
of  the  measures  used  in  the  37  observational  records  consid¬ 
ered  here.  Access  to  these  within-study  pairs  of  prevalence 
estimates  allowed  us  to  create  prevalence  ratios  to  transform 
prevalence  estimates  based  on  one  measure  to  estimates  based 
on  another  measure.  The  common  metric  we  transformed  to 
was  the  DSM-IV  coding  scheme  for  the  PHQ-9.  The  latter 
scheme,  used  in  five  of  the  25  studies  considered  here,  requires 
at  least  five  of  the  nine  PHQ-9  questions  about  criterion  A 
symptoms  of  MD  to  be  reported  as  having  occurred  more 
than  half  the  days  over  the  recall  period  and  for  at  least  one 
of  these  questions  to  involve  either  depressed  mood  or  anhe- 
donia.  One  study  reported  prevalence  estimates  based  on  this 
PHQ-9/DSM-IV  coding  scheme  as  well  as  on  the  PHQ-9 
severity  coding  scheme  and  the  PHQ-2.55  Three  separate 
articles  from  a  second  study  reported  prevalence  estimates 
based  on  this  same  set  of  three  coding  schemes.34'40'42  Two 
other  studies  reported  prevalence  estimates  based  on  both 
the  PHQ-9/DSM-IV  and  the  version  of  the  PHQ-9  coding 
scheme  that  requires  impairment.13,25  We  used  the  weighted 
average  ratios  of  prevalence  estimates  based  on  these 
alternative  coding  schemes  to  transform  prevalence  estimates 
based  on  other  measures  to  PHQ-9/DSM-IV  prevalence  esti¬ 
mates.  In  cases  where  no  study  existed  that  presented  preva¬ 
lence  estimates  based  on  both  the  PHQ-9/DSM-IV  coding 
scheme  and  one  of  the  other  measures,  we  made  indirect 
calibrations  using  a  third  measure.  For  example,  although 


there  were  no  studies  that  included  prevalence  estimates 
based  on  both  the  CES-D  and  the  PHQ-9/DSM-IV  coding 
scheme,  one  study  presented  both  CES-D  and  PHQ-2  preva¬ 
lence  estimates,56  whereas  two  others  presented  both  PHQ-2 
and  PHQ-9/DSM-IV  prevalence  estimates,34’40’55  allowing 
us  to  multiply  these  two  separate  ratios  together  to  generate 
a  synthetic  CES-D  versus  PHQ-9/DSM-IV  calibration  ratio 
to  adjust  prevalence  estimates  in  the  one  study  that  used 
the  CES-D  to  estimate  depression  prevalence. 

As  shown  in  Figure  1,  this  calibration  exercise  substan¬ 
tially  reduces  the  average  prevalence  estimates  from  median 
and  mean  of  6.1  and  8.0%  in  the  weighted  raw  data  to  3.0 
and  4.5%  in  the  weighted  calibrated  data  as  well  as  in  the 
IQR  (from  6.5%  [between  3.8  and  10.3%]  in  the  raw  data 
to  2.7%  [between  2.3  and  5.0%]  in  the  calibrated  data). 

Multiple  Regression  Analysis 

The  test  for  the  variance  of  random  intercepts  is  significant 
in  the  random  effects  model  of  the  raw  outcomes  (%2i  =  13.0, 
p  <  0.001),  indicating  significant  heterogeneity  among  obser¬ 
vations.  This  supports  the  decision  to  use  the  random  effects 
model.  Methodological  and  substantive  variables  are  both 
significant  predictors  of  variation  in  MD  prevalence  esti¬ 
mates  across  the  37  raw  observational  records  (Table  III). 
With  regard  to  methodological  factors,  prevalence  estimates 
differ  significantly  by  type  of  screening  scale  (j27  =  37.1, 
p  <  0.001)  and  are  significantly  higher  in  anonymous  than 
identified  surveys  (odds  ratio  [OR]  =  3.1,  t  =  4.2 ,  p  =  0.002). 
With  regard  to  substantive  factors,  prevalence  estimates 


TABLE  III.  Association  Between  Anonymity  of  Survey,  Deployment  Status  and  Type  of  Measure  and  Prevalence  of  MD  Using  Random 

Effects  Models  with  Logit  Links 


Model  A.  DV:  Prevalence  as 
Reported  in  Articles" 

Model  B.  DV:  Prevalence  Adjusted 

Based  on  Calibration  of  Measures" 

Model  C.  Model  B  Without 

Dummies  for  Measures 

OR 

(95%  Cl) 

OR 

(95%  Cl) 

OR 

(95%  Cl) 

Intercept 

0.0* 

(O.O-O.O) 

0.0* 

(O.O-O.O) 

0.0* 

(O.O-O.O) 

Anonymous 

Yes 

3.1* 

(1. 7-5.6) 

2.9* 

(1 .6— 5.3) 

3.0* 

(1.9M.8) 

No 

1.0 

— 

1.0 

— 

1.0 

— 

Measure 

3-Item  Bumam 

5.9* 

(3.0-11.5) 

1.0 

(0.5-1 .9) 

8-Item  Bumam 

6.3* 

(2.5-16.2) 

0.9 

(0.6-3. 8) 

CES-D 

24.9* 

(8.2-75.4) 

3.2* 

(1. 1-9.6) 

PHQ-2 

2.9* 

(1. 1-7.5) 

0.6 

(0.2-1 .6) 

PHQ-8 

5.0* 

(1.6-15.6) 

0.9 

(0.3-2.8) 

PHQ-9 

5.8* 

(2.5-13.3) 

0.9 

(0.4-2. 1) 

PHQ-9/DSM-IV 

1.9 

(1.0-3. 8) 

0.8 

(0.4-1 .6) 

PHQ-9/DSM-IV+FI 

1.0 

— 

1.0 

— 

1.0 

— 

Deployment  Status 

Currently 

2.7* 

(1.4-5. 2) 

2.2* 

(1.2 — 4.1) 

2.3* 

(1.4-3.6) 

Previously 

3.2* 

(2.6-3. 9) 

2.5* 

(2.0-3. 1) 

2.5* 

(2.0-3. 1) 

Never 

1.0 

— 

1.0 

— 

1.0 

— 

DV,  Dependent  variable;  FI,  Functional  impairment. 

"The  8  coefficients  associated  with  type  of  measure  differ  significantly  from  each  other  in  Model  A  (/27  =  31.1,  p  <  0.001)  but  not  Model  B  {x2i  =  2.2,  p  =  0.95), 
suggesting  that  the  calibration,  which  was  based  on  data  independent  of  the  studies  analyzed  here,  succeeded  in  correcting  for  between-scale  differences  in 
concordance  with  clinical  diagnoses. 

*p  <  0.05. 
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are  significantly  higher  among  the  currently  deployed  (OR  = 
2.7,  t  =  3.4,  p  =  0.006)  and  previously  deployed  (OR  =  3.2, 
t=  12.7,  p  <  0.001)  than  the  never  deployed.  It  is  important  to 
recognize  that  results  are  based  on  a  multivariate  analysis, 
which  means  that  ORs  for  each  predictor  are  net  of  those  for 
other  predictors. 

The  results  are  different  when  the  same  model  is  used 
to  predict  variation  in  recalibrated  MD  prevalence  estimates. 
The  most  dramatic  difference  is  that  estimated  prevalence  is 
no  longer  predicted  significantly  by  type  of  screening  scale 
(/27  =  2.2,  p  =  0.95),  indicating  that  the  recalibration  exer¬ 
cise  was  successful.  However,  survey  anonymity  remains 
significantly  associated  with  elevated  prevalence  (OR  =  2.9, 
t  =  4.0,  p  =  0.002).  Furthermore,  the  currently  deployed  and 
previously  deployed  continue  to  have  significantly  higher 
prevalence  estimates  than  the  never  deployed,  although  these 
ORs  are  somewhat  lower  than  when  the  model  is  estimated 
on  the  raw  data  (OR  =  2.2,  t  =  2.7 ,  p  =  0.020  for  currently 
deployed;  OR  =  2.5,  /  =  9.8,  p  <  0.001  for  previously 
deployed).  These  significant  ORs  change  only  modestly  in  a 
model  that  deletes  predictors  for  type  of  screening  scale. 

MD  Prevalence  Estimates  Based  on  the  Best-Fitting  Regression  Model 

Based  on  the  assumption  that  the  higher  MD  prevalence 
estimates  in  anonymous  surveys  are  more  accurate  than  the 
lower  estimates  in  nonanonymous  surveys,  the  parameters  of 
the  best-fitting  random-effects  model  for  the  calibrated  data 
were  used  to  generate  best  estimates  of  MD  prevalence  for 
anonymous  surveys.  As  noted  above  in  the  section  on  analy¬ 
sis  methods,  the  jackknife  resampling  method  was  used  to 
generate  estimates  of  standard  error  (SE).  Best  estimates  of 
current  MD  prevalence  (SE)  are  12.0%  (1.2)  for  the  currently 
deployed,  13.1%  (1.8)  for  the  previously  deployed,  and  5.7% 
(1.2)  for  the  never  deployed. 

Other  Correlates  ofMD 

A  number  of  surveys  report  MD  prevalence  by  socio¬ 
demographic  variables  and/or  by  branch  of  service.  We  cal¬ 
culated  ORs  for  these  estimates  within  studies  and  then 
summarized  these  results  by  computing  weighted  (by  sample 
size)  averages  of  ORs  across  studies.  Women  are  found  con¬ 
sistently  to  have  higher  rates  of  MD  than  men  with  a  mean 
(range)  OR  of  1.6  (1.1-1. 9)  across  studies  (Table  IV).  Preva¬ 
lence  also  is  higher  among  respondents  with  no  more  than 
high  school  education  (3.0  [2.0-3. 6])  or  some  college  educa¬ 
tion  (1.8  [1. 6-2.1])  relative  to  college  graduates.  Prevalence 
generally  is  unrelated  to  race-ethnicity.  Prevalence  is  consis¬ 
tently  higher  among  enlisted  (2.8  LI. 9-3. 6])  personnel  than 
warrant  officers  (1.1  [0.9-1. 2])  or  commissioned  officers  (the 
contrast  category,  with  an  implicit  OR  of  1.0).  In  addition, 
MD  generally  is  estimated  to  be  more  common  among  youn¬ 
ger  (up  to  ages  24  or  25)  than  older  (older  than  24  or  25) 
respondents  (2.0  [1.0-2. 2])  and  among  the  unmarried  (either 
never  married  or  previously  married)  than  the  married  (1.8 
[1. 0-2.0]).  The  studies  that  compared  MD  across  services 


report  consistently  higher  prevalence  in  the  Army  (2.0  [1.6- 
2.1]),  Navy  (1.7  [1.3— 1.8]),  and  Marines  (2.0  [1. 4-2.3])  than 
the  Air  Force. 

The  Simulation 

Current  Depression  Prevalence  Estimates  in  the  Simulation  Data 
A  question  can  be  raised  how  the  prevalence  estimates 
reported  above  for  current  MD  compare  to  the  general  U.S. 
population.  The  comparable  prevalence  estimate  (SE)  in  the 
simulated  NCS-R  data  is  1.3%  (0.6).  To  be  clear,  this  is  the 
rate  we  would  expect  in  a  representative  sample  of  people  in 
the  U.S.  population  who  have  the  same  sociodemographic 
profile  (e.g.,  age,  sex,  race-ethnicity,  and  education)  and 
history  of  pre-enlistment  health  problems  as  the  members  of 
the  U.S.  Army.  The  1.3%  MD  prevalence  estimate  is  substan¬ 
tially  lower  than  the  estimates  reported  above  in  the  meta¬ 
analysis.  Even  though  the  NCS-R  simulation  uses  a  different 
measure  of  MD  than  any  of  the  meta-analysis  surveys,  the 
NCS-R  measure  has  been  validated  in  the  general  population 
and  shown  to  yield  a  prevalence  estimate  very  similar  to  the 
estimate  based  on  blinded  clinical  reappraisal  interviews 
using  DSM-IV  criteria.2  We  would  consequently  expect  that 
the  meta-analysis  estimates,  based  on  calibration  to  the  PHQ-9/ 
DSM-IV,  would  be  comparable  to  the  simulated  NCS-R/ 
DSM-IV  estimates. 

Lifetime  Depression  Prevalence  Estimates  in  the  Simulation  Data 
As  noted  in  the  introduction,  much  less  is  known  about  life¬ 
time  prevalence  than  current  prevalence  of  MD  among  mili¬ 
tary  personnel.  The  simulated  NCS-R  data  show  that  16.2% 
of  the  sample  has  a  lifetime  history  of  MD,  that  69.7% 
of  these  lifetime  cases  (i.e.,  11.3%  of  the  total  sample)  had 
first  onsets  before  enlistment  (i.e.,  before  the  age  when  we 
would  have  expected  them  to  enlist  based  on  their  socio¬ 
demographic  profile),  and  that  the  remaining  30.3%  of  life¬ 
time  cases  (i.e.,  5.5%  of  the  total  sample)  had  first  onsets  only 
after  enlistment  (Table  V).  The  majority  of  those  with  current 
MD  had  first  onsets  before  enlistment  (77.9%).  The  latter  is 
higher  than  the  69.7%  of  lifetime  cases  with  pre-enlistment 
MD,  suggesting  that  MD  persistence  is  higher  among  early- 
onset  than  later-onset  cases.  This  higher  persistence  is  indi¬ 
rectly  indicated  by  the  fact  that  the  ratio  of  current  to  lifetime 
prevalence  is  higher  among  respondents  with  pre-enlistment 
(8.8%)  than  postenlistment  (5.4%)  MD. 

Given  the  much  higher  prevalence  of  current  MD  among 
Army  personnel  than  expected  from  the  simulations,  it  is 
unclear  from  these  data  what  percent  of  actual  Army  person¬ 
nel  with  current  MD  had  first  onsets  before  enlistment.  The 
high  current  prevalence  estimates  from  the  meta-analysis 
could  reflect  either  a  dramatic  increase  in  current  prevalence 
among  lifetime  cases  once  they  enter  the  Army,  a  dramatic 
increase  in  postenlistment  first  onset,  or  a  combination.  If  it 
is  true  that  1 1.3%  of  actual  Army  personnel  had  a  history  of 
MD  before  enlistment,  and  if  postenlistment  onsets  were 
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TABLE  IV.  Sociodemographic  Correlates  of  MD:  Weighted  Average  and  Range  Across  Studies  of  Within-Study  ORs 


Weighted  Average 

Minimum 

Maximum 

Number  of  Studies 

(n) 

Gender" 

8 

Female 

1.6 

i.i 

1.9 

42,982 

Male 

1.0 

135,194 

Race-Ethnicity6 

7 

African  American,  non-Hispanic 

1.1 

0.7 

1.4 

26,617 

Hispanic 

1.1 

0.9 

1.3 

23,572 

Other 

1.2 

1.0 

1.3 

7,433 

White,  non-Hispanic 

1.0 

116,289 

Education 

6 

High  School  or  Less 

3.0 

2.0 

3.6 

63,616 

Some  College 

1.8 

1.6 

2.1 

60,529 

College  Graduate  or  Higher 

1.0 

43,236 

Agec 

8 

24/25  or  Younger 

2.0 

1.0 

2.2 

53,022 

25/26  or  Older 

1.0 

125,135 

Marital  Status'* 

7 

Not  Married 

1.8 

1.0 

2.0 

65,596 

Married 

1.0 

105,028 

Rank 

6 

Enlisted 

2.8 

1.9 

3.6 

129,648 

Warrant  Officer 

1.1 

0.9 

1.2 

4,319 

Commissioned  Officer 

1.0 

33,414 

Service^ 

6 

Army 

2.0 

1.6 

2.1 

58,279 

Navy 

1.7 

1.3 

1.8 

36,942 

Marine  Corps 

2.0 

1.4 

2.3 

22,985 

Air  Force 

1.0 

45,319 

Between  6  and  8  studies  were  used  to  examine  each  of  the  7  correlates.  Six  studies  were  the  same  for  all  7  correlates.14-19  The  other  studies  used  varied  across 
correlates,  as  described  in  the  following  notes.  Within  each  study  ORs  were  calculated  and  then  a  weighted  average  OR  was  calculated  across  studies. 

"For  gender,  we  used  the  postdeployment  sample  of  Luxton  et  al30  and  the  combined  3-  and  12-month  assessment  of  Riviere  et  al.12 

"The  race-ethnicity  categories  reported  here  are  the  ones  used  in  the  largest  set  of  studies.1  -  9  One  other  study  used  the  categories  “Caucasian,”  “African 
American,”  “Hispanic,”  and  “Asian”  (we  coded  the  latter  as  “other”),24  whereas  another  used  the  categories  “White  non-Hispanic,”  “Black  non-Hispanic,” 
and  “other.”  4  In  this  case,  we  coded  the  category  “other”  as  “Hispanic”  because  the  majority  of  this  group  is  assumed  to  be  Hispanic. 
cThe  largest  set  of  studies  used  here15-19  provided  information  on  age  <25  and  >26,  whereas  the  other  studies12,14’24  provided  information  on  age  <24 
and  >25. 

^Also  based  on  Riviere  et  al.12  “Not  married”  includes  “never  married,”  “divorced/widowed,”  and  “single”  (and  “separated”  in  the  largest  set  of 
studies,15-19  this  is  unclear  for  the  other  studies).  In  one  study,15  “married”  includes  living  in  a  marriage-like  relationship,  whereas  in  others16-19  only 
legally  married  personnel  were  included  as  “married”  (this  is  unclear  for  the  rest  of  the  studies). 

‘  In  Riddle  et  al,14  Navy  and  Coast  Guard  are  combined. 


TABLE  V.  Simulated  Lifetime,  12-Month  and  Past  30  Days  Prevalence  Estimates  of  DSM-IV/CIDI  Major  Depressive  Episode  and 
Dysthymic  Disorder  in  the  Subsample  of  NCS-R  Respondents  Weighted  to  Approximate  the  Population  of  Active  duty  Army  Personnel 

(n  =  1785) 


Proportion  of  Prevalence 

Prevalence  of  Disorder  That  Prevalence  of  Disorder  That  Because  of  Disorder  That 

Prevalence  Began  Before  Enlistment  Began  Only  After  Enlistment  Began  Before  Enlistment 


% 

(SE) 

% 

(SE) 

% 

(SE) 

% 

(SE) 

I.  Lifetime  Prevalence 

16.2 

(3.1) 

11.3 

(2.7) 

5.5 

(1.7) 

69.7 

(8.5) 

II.  12-Month  Prevalence 

6.3 

(2.1) 

5.0 

(2.1) 

1.4 

(0.3) 

79.8 

(7.9) 

III.  Past  30  Days  Prevalence 

1.3 

(0.6) 

1.0 

(0.5) 

0.3 

(0.1) 

77.9 

(11.3) 

Respondents  are  all  ages  18  to  65  years,  have  at  least  a  high  school  education,  and  are  employed  with  health  insurance  to  match  the  broad  sociodemographic 
profile  of  Army  personnel.  All  NCS-R  respondents  who  were  ever  convicted  of  a  felony  or  served  at  least  1  year  in  prison  were  excluded  from  the  sample. 
All  NCS-R  respondents  with  handicaps  or  physical  or  mental  disorders  that  would  normally  lead  to  rejection  from  Army  enlistment  or  discharge  were 
excluded  from  the  sample.  The  handicaps  included  deafness,  blindness,  paralysis  (of  one  or  both  arms,  legs  or  sides  of  the  body),  and  a  missing  limb  (hand, 
foot,  arm  or  leg).  The  physical  disorders  included  cardiovascular  disorders  (heart  attack,  stroke,  hypertension,  heart  disease),  respiratory  disorders  (COPD, 
asthma),  diabetes,  ulcer,  HIV  or  AIDS,  epilepsy  or  seizure  disorder,  Crohn’s  disease,  cancer  (except  skin  cancer),  severe  migraines,  and  extreme  obesity.  The 
mental  disorders  included  schizophrenia  and  other  nonaffective  psychoses,  BP-I  disorder,  and  serious  suicide  attempts  that  occurred  before  the  imputed  age 
of  enlistment. 
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double  the  estimate  in  the  simulation  data  (i.e.,  1 1.0%  rather 
than  5.5%),  then  the  ratio  of  current  to  lifetime  prevalence 
would  have  to  be  at  least  25%  among  the  never  deployed 
(5.7/22.3)  and  50%  among  the  currently  (12.0/22.3)  and  pre¬ 
viously  (13.1/22.3)  deployed  to  generate  the  estimates  of 
current  prevalence  found  in  the  meta-analysis  data. 

DISCUSSION 

The  meta-analysis  reported  here  was  limited  by  the  fact  that  a 
wide  range  of  MD  screening  scales  were  used  in  the  different 
studies  and  by  the  fact  that  prevalence  estimates  vary  signif¬ 
icantly  by  type  of  screening  scale.  We  attempted  to  address 
this  problem  by  transforming  the  screening  scale  prevalence 
estimates  to  a  common  metric  based  on  the  results  of  pub¬ 
lished  validity  studies.  This  attempt  failed,  though,  as  some 
“corrected”  prevalence  estimates  were  less  than  zero.  This 
means  that  the  sensitivity  and  specificity  estimates  reported 
in  the  published  validity  studies,  all  of  which  were  carried  out 
in  civilian  samples  and  mostly  among  primary  care  patients, 
do  not  apply  to  the  military  samples  considered  here.  The 
calibration  approach  we  used  to  address  this  problem  was 
limited  by  the  fact  that  it  required  the  assumption  that  preva¬ 
lence  ratios  across  different  screening  scales  in  a  single  sur¬ 
vey  could  legitimately  be  generalized  across  surveys.  Future 
epidemiological  studies  of  depression  in  the  military  should 
address  this  problem  more  directly  by  using  the  same  screen¬ 
ing  measure.  This  would  be  consistent  with  recent  recom¬ 
mendations  for  use  of  common  data  elements  in  surveillance 
and  operational  stress  research. 17 

It  is  noteworthy  that  none  of  the  screening  scales  included 
an  exclusion  for  bipolar  disorder  (BPD).  This  means  that  they 
screened  for  major  depressive  episodes  (MDE),  not  for  major 
depressive  disorder  (MDD),  and  that  some  unknown  propor¬ 
tion  of  these  cases  represents  depressive  phases  of  a  BPD.  As 
bipolar-I  (BP-I)  leads  to  military  discharge  and  is  so  dramatic 
during  the  manic  phase  that  it  is  likely  to  have  a  high  rate 
of  detection,  we  would  not  expect  many  cases  of  MD  in  the 
military  to  be  associated  with  BP-I.  But,  BP-II  and  subthresh¬ 
old  BPD  are  together  much  more  common  than  BP-I3  and 
often  go  undetected.  Intervention  implications  are  quite  dif¬ 
ferent  for  BPD  than  MDD,  making  it  important  to  distinguish 
between  the  two.  Future  epidemiological  studies  of  depres¬ 
sion  in  the  military  should  address  this  problem  by  including 
a  BPD  screen  and  the  MD  screen.  This  is  being  done  in  the 
Army  STARRS  study,  but  we  are  unaware  of  any  other  large- 
scale  military  epidemiological  survey  that  has  done  so. 

It  would  also  be  useful  to  include  a  small  clinical 
reappraisal  component  in  each  future  major  epidemiological 
survey  of  military  mental  health  even  if  a  consistent  MD 
screening  scale  was  used.  Repeated  validity  studies  are 
needed  because  variation  in  the  accuracy  of  any  screening 
scale  across  studies  can  be  influenced  by  survey  conditions 
(e.g.,  anonymity,  rationale,  the  context  created  by  preceding 
survey  questions,  and  the  physical  conditions  of  respondents  at 
the  time  of  survey  implementation)  that  vary  across  studies.58 


Another  methodological  feature  that  could  usefully  be 
added  to  future  military  epidemiological  studies  would  be  a 
nonrespondent  adjustment  process.  This  could  include  a 
nonrespondent  subsampling  outreach  phase  in  which  limited 
information  is  obtained  from  a  probability  subsample  of  sur¬ 
vey  nonrespondents.  Or  it  could  use  administrative  databases 
(e.g.,  information  from  military  electronic  medical  records 
about  history  of  diagnoses  of  mental  illness)  to  weight  the 
survey  data  for  underrepresentation  of  personnel  with  profiles 
associated  with  high  risk  of  MD  or  other  mental  disorders. 
Methods  of  these  sorts  have  been  used  successfully  to  address 
sample  bias  in  other  epidemiological  surveys.59  Weighting 
seems  like  an  especially  attractive  approach  in  military  sur¬ 
veys  in  light  of  the  existence  of  an  extensive  series  of  admin¬ 
istrative  databases  for  all  military  personnel. 

A  related  limitation  of  our  meta-analysis  is  that  the  sam¬ 
ples,  although  largely  based  on  probability  selection  methods 
within  the  units  studied,  often  used  nonprobability  methods 
to  select  units  and,  within  units,  to  select  critical  times  in  the 
unit  life  cycle.  This  led  to  over-representation  of  combat  units 
as  well  as  to  over-representation  of  the  months  just  before 
deployment  and  just  after  redeployment.  Although  it  would 
theoretically  be  possibly  to  correct  for  these  sampling  biases 
with  weights,  the  logistical  complexities  of  doing  so  made 
this  impossible  in  practice.  As  a  result,  caution  is  needed 
in  drawing  inferences  from  our  summary  results  because  of 
the  likely  skewed  distributions  in  our  samples  of  military 
occupation  specialties  (MOSs),  units,  and  timing  of  deploy¬ 
ment  histories.  Not  all  of  the  samples  considered  in  our  meta¬ 
analysis  shared  this  last  limitation,  as  some  surveys  were 
representative  of  all  military  personnel  in  one  or  more 
branches  of  service.  However,  in  order  to  use  the  data  from 
these  studies  to  adjust  the  results  across  all  studies,  we  would 
have  needed  to  work  with  individual-level  data  rather  than 
the  aggregate  data  available  to  us.  This  highlights  another 
limitation  of  our  meta-analysis;  that  it  was  based  on  summary 
published  results  rather  than  on  secondary  analysis  of  indi¬ 
vidual-level  data.  More  fine-grained  analysis  could  have  been 
carried  out  in  individual-level  secondary  analysis,  including 
but  not  limited  to,  using  weights  to  adjust  sample  composi¬ 
tion  for  the  over-representation  of  some  MOSs,  types  of 
units,  and  deployment  histories.  Pooled  secondary  analysis 
of  existing  survey  data  has  been  of  great  value  in  advancing 
our  understanding  of  the  epidemiology  of  depression  in  the 
general  population.60  The  same  could  be  true  for  research 
on  depression  (and  other  mental  disorders)  in  the  military  if 
de-identified  individual-level  data  were  made  available. 

Our  simulation  had  only  a  limited  set  of  variables,  some 
of  them  relatively  coarse,  to  match  the  NCS-R  national 
household  sample  with  the  characteristics  of  Army  personnel. 
Failure  to  control  for  the  many  unmeasured  selection  factors 
that  might  influence  both  enlistment  in  the  Army  and  depres¬ 
sion  could  have  distorted  the  results.  In  addition,  the  simula¬ 
tion  was  designed  to  provide  data  on  what  the  prevalence 
and  age-of-onset  distribution  of  depression  would  have  been 
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expected  to  be  among  Army  personnel  if  they  had  not 
joined  the  Army.  Although  that  kind  of  information  is  poten¬ 
tially  useful  in  assessing  the  impact  of  Army  experiences, 
in  the  context  of  the  much  higher  current  prevalence  esti¬ 
mates  in  the  meta-analysis  than  the  simulation,  it  tells  us 
nothing  about  the  lifetime  prevalence  of  postenlistment  onset 
depression  or  about  the  persistence  of  either  pre-enlistment  or 
postenlistment  depression. 

Our  estimates  of  current  MD  prevalence  in  the  military 
are  much  higher  than  the  30-day  prevalence  estimate  obtained 
for  sociodemographically  comparable  civilians  in  the  simula¬ 
tion  study.  It  needs  to  be  noted  that  the  MD  prevalence  esti¬ 
mates  from  the  meta-analysis  were  generated  based  on  the 
parameters  for  anonymous  surveys,  whereas  the  simulation 
results  are  based  on  confidential  (but  not  anonymous)  inter¬ 
views  from  a  general  population  survey.  Previous  research 
indicates  that  respondents  are  more  likely  to  provide  accurate 
information  on  sensitive  questions  in  anonymous  versus  con¬ 
fidential  questionnaires,61  but  this  would  explain  only  a  small 
part  of  the  difference  in  MD  prevalence  estimates.  The  finding 
that  the  prevalence  estimate  was  higher  for  the  previously  than 
currently  deployed  could  be  an  artifact  in  that  the  previously 
deployed  personnel  considered  here  over-represented  those 
who  had  recently  returned  from  deployment.  Current  preva¬ 
lence  among  military  personnel  was  estimated  to  be  higher  for 
women  than  men,  young  than  old,  the  unmarried  than  the 
married,  and  those  with  lower  than  higher  rank  and  education. 
These  correlates  are  broadly  consistent  with  those  found  in 
general  population  surveys.2'62 

We  estimated  that  16.2%  of  respondents  in  the  simulation 
data  had  a  lifetime  history  of  MD  and  that  the  majority 
(69.7%)  of  these  lifetime  cases  had  first  onsets  before 
expected  time  of  enlistment.  We  have  no  comparable  lifetime 
prevalence  estimate  in  the  meta-analysis  data  although  we 
would  expect  that  lifetime  prevalence  among  military  person¬ 
nel  would  be  higher  because  of  a  presumed  larger  number  of 
postenlistment  onsets  than  at  comparable  ages  in  the  general 
population.  In  the  absence  of  a  direct  estimate  of  persistence, 
though,  we  have  no  way  to  know  how  much  higher  the 
prevalence  of  postenlistment  onsets  are  in  the  military  than 
the  general  population  or  the  proportion  of  current  cases  that 
had  first  onsets  before  enlistment.  However,  the  high  esti¬ 
mated  pre-enlistment  lifetime  prevalence  in  the  simulation 
data,  when  coupled  with  the  finding  that  early  onset  is  posi¬ 
tively  associated  with  persistence,  leads  us  to  speculate  that 
a  substantial  minority  or  perhaps  even  a  majority  of  military 
personnel  with  current  depression  had  first  onsets  before 
enlistment.  To  the  extent  that  this  is  true,  secondary  preven¬ 
tive  interventions  with  recruits  having  a  pre-enlistment  his¬ 
tory  of  depression  (or  other  mental  disorders  that  predict 
subsequent  depression)  might  be  effective  in  reducing  inci¬ 
dence  of  subsequent  depressive  episodes  among  military 
personnel.  More  direct  data  would  be  needed,  though,  on 
lifetime  history,  age-of-onset,  current  prevalence,  and  sever¬ 
ity  of  current  depression  in  representative  military  samples 
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before  any  such  interventions  could  reasonably  be  planned. 
The  Army  STARRS  study  is  collecting  such  data  for  the 
Army,  but  we  are  unaware  of  any  attempt  to  collect  compa¬ 
rable  data  for  other  branches  of  the  military. 
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